Syllable classification using static matrices and prosodic features
نویسندگان
چکیده
In this paper we explore the usefulness of prosodic features for syllable classification. In order to do this, we represent the syllable as a static analysis unit such that its acoustic-temporal dynamics could be merged into a set of features that the SVM classifier will consider as a whole. In the first part of our experiment we used MFCC as features for classification, obtaining a maximum accuracy of 86.66%. The second part of our study tests whether the prosodic information is complementary to the cepstral information for syllable classification. The results obtained show that combining the two types of information does improve the classification, but further analysis is necessary for a more successful combination of the two types of features.
منابع مشابه
طراحی و ارزیابی یک مدل بازسازی گفتار به روش همگذاری واحدهای حساس به بافت نوایی
This paper describes the design and evaluation of prosodically-sensitive concatenative units for a Persian text-to-speech (TTS) synthesis system. Thesyllables used are prosodically conditioned in the sense that a single conventional syllable is stored as different versions taken directly from the different prosodic domains of the prosodically labeled, read sentences. The three levels of the Per...
متن کاملSyllable-based Regional Swiss French Accent Identification using Prosodic Features
In this paper an attempt is made to automatically recognize speaker’s accent among regional Swiss French accents from four regions of Switzerland. To achieve this goal a syllable-based classification framework is implemented using prosodic features extracted from the speech signal. Since, among these regional accents, the variations in speech mainly originate from the speaking style, i.e., diff...
متن کاملMFCC and Prosodic Feature Extraction Techniques:
In this paper our main aim to provide the difference between cepstral and non-cepstral feature extraction techniques. Here we try to cover-up most of the comparative features of Mel Frequency Cepstral Coefficient and prosodic features. In speaker recognition, there are two type of techniques are available for feature extraction: Short-term features i.e. Mel Frequency Cepstral Coefficient (MFCC)...
متن کاملImportance of Prosodic Features in Language Identification
Earlier Researches in LID systems found that inclusion of prosodic features (alike, speech rate, Fundamental Frequency and Syllable timing) offered a little to develop the performance of their systems. Focused study on the utility of prosodic feature is attempted to evaluate parameters to behold the fundamental frequency and amplitude contours on syllable – by-syllable bases. The timing relatio...
متن کاملProsodic Features in Automatic Language Identification Reflect Language Typology
Results from a prosody-based automatic language discrimination (LID) system suggest that the difficulties reported by other sites in incorporating prosodic information into LID systems may have been caused by their not using appropriate task-specific features. Running averages and correlations of prosodic features capturing syllable pitch and amplitude contours, duration and phrase location wer...
متن کامل